Benfords Law (Operator Toolbox)

Synopsis

This operator tests, if columns in your data set are following Benford's Law

Description

Benford's Law states, that for real world applications often the distribution of leading digits of numbers is not uniform. Smaller digits (like 1 or 2) appear more often than larger digits. For more information please see: https://en.wikipedia.org/wiki/Benford's_law

This operator checks, if the first digit of the values ina numerical column follow this law. It first counts how often a leading digit occurs and calculates the frequency. This frequency is returned at the frequency port. Also the expected frequency according to Benford's law is reported.

In a second step a p-value is calculated. This value indicates how likely it is, that the observed distribution of leading digits is compatible with Benford's law. Low numbers mean, that the distribution is different to what's expected.

Input

  • exa (Data table)

    The input example set

Output

  • frequency (Data table)

    A table with the frequency of the first digit in the data set and the expected frequency if the data set would follow Benford's law.

  • pvalue (Data table)

    The table with the pvalue ("probability") that a column follows Benford's Law

  • ori (Data table)

    The original data set

Parameters

    Tutorial Processes